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ABSTRACT 


We  present  a  theory  that  combines  order  of  magnitude  reasoning  with 
envisionment  of  qualitative  differential  equations.  Such  a  theory  can  be  used 
to  reason  qualitatively  about  dynamical  systems  containing  parameters  of 
widely  varying  magnitudes.  We  present  an  a  mathematical  analysis  of  envi- 
sionment over  orders  of  magnitude,  including  a  complete  categorization  of 
adjacent  pairs  of  qualitative  states.  We  show  how  this  theory  can  be  applied 
to  simple  problems,  we  give  an  algorithm  for  generating  a  complete  envision- 
ment graph,  and  we  discuss  the  implementation  of  this  algorithm  in  a  running 
program. 
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1.   Introduction 

Two  mathematical  techniques  that  have  been  found  particularly  useful  in  recent  work  on 
qualitative  physical  reasoning  are  the  solution  of  qualitative  differential  equations  through 
envisionment  and  order  of  magnitude  reasoning.  The  object  of  this  paper  is  to  combine  these 
two  theories  mathematically. 

Envisionment  and  qualitative  differential  equations  are  used  to  analyze  the  behavior  of 
dynamical  systems.  If  the  physical  structure  of  a  dynamic  system  can  be  characterized  in 
terms  of  relations  between  a  finite  set  of  state  variables  and  their  derivatives,  then  the  envi- 
sionment process  can  determine  at  least  partially  the  behavior  over  time  of  these  variables 
and  derivatives.  The  strength  of  the  envisionment  procedure  is  that  it  can  use  a  partial 
categorization  of  the  physical  relations  among  the  variables  to  derive  a  useful  partial  informa- 
tion about  the  behavior.  For  example,  consider  a  block  attached  to  a  spring,  moving  in  a 
straight  line  without  friction.  Given  the  weak  constraint  that  the  spring  always  exerts  a  force 
on  the  block  pointed  toward  its  rest  point,  the  envisionment  process  can  predict  that  the  block 
will  oscillate  around  the  rest  point.  Envisionment  has  been  applied  in  numerous  physical 
domains,  including  electronics,  mechanics,  hydraulics,  and  heat  transfer  ([de  Kleer  and 
Brown,  85],  [Forbus,  85],  [Williams,  85]).  The  clearest  exposition  of  the  mathematics  of  the 
theory  is  [Kuipers,  85].  Envisionment  suffers  from  a  number  of  limitations:  it  is  sometimes 
too  weak  mathematically  to  yield  important  inferences  ([Kuipers,  85],  [Struss,  87]),  and  its 
focus  on  differential  behavior  sometimes  forces  much  complex  but  useless  information  to  be 
generated  ([Davis,  86]).  Nonetheless,  for  many  types  of  physical  inference,  it  is  a  simple  and 
effective  mode  of  analysis. 

Order  of  magnitude  reasoning  is  concerned  with  the  analysis  of  physical  systems  in 
which  one  quantity  is  much  greater  than  another,  or  in  the  comparison  of  two  systems  of  the 
same  structure,  but  which  have  corresponding  quantities  of  very  different  magnitudes.  For 
example,  if  a  very  massive  block  hits  a  very  light  one,  order  of  magnitude  reasoning  can  be 
used  to  infer  that  the  massive  block  continues  on  its  way  unaffected,  while  the  light  block 
bounces  off  away  from  the  massive  one.  The  idea  of  order  of  magnitude  reasoning  is  to 
approximate  a  very  great  ratio  as  an  infinite  ratio,  and  then  to  analyze  the  system  in  terms  of 
an  algebra  of  infinite  and  infinitesimal  quantities.  Such  an  algebra  has  been  worked  out  in 
[Raiman,  86]  and  extended  in  [Dague,  Raiman,  and  Deves,  87]. 

The  natural  next  step  is  to  combine  these  two  modes  of  inference  so  that  we  may  reason 
about  the  dynamical  behavior  of  systems  with  quantities  of  widely  varying  magnitudes.  For 
example,  we  would  like  to  be  able  to  reason  that  a  very  heavy  block  on  a  spring  will  have  a 
much  longer  period  of  oscillation  than  a  much  lighter  block  on  the  same  spring.  This  paper 
presents  a  theory  combining  order  of  magnitude  reasoning  with  envisionment  of  qualitative 
differential  equations,  which  supports  such  inferences.  The  theory  introduces  two  technical 
innovations.  First,  each  state  of  the  system  is  labelled  with  its  duration  and  with  the  net 


The  problem  addressed  here  was  first  brought  to  my  attention  by  Dan  Weld.  I  thank  Dan  Weld, 
Olivier  Raiman,  Asher  Meth,  Ben  Kuipers,  Leo  Joskowicz,  and  Yumi  Iwasaki  for  helpful  discussions, 
and  for  their  criticisms  of  an  preliminary  draft  of  this  paper.  This  research  was  supported  by  NSF  grant 
DCR-8603758. 


change  to  each  parameter  during  the  state.  Second,  we  formulate  a  number  of  rules  govern- 
ing the  behavior  of  functions  over  orders  of  magnitudes.  In  other  respects,  each  of  our  com- 
ponent subtheories  is  weaker  than  standard  theories  in  the  literature.  Our  theory  of  envision- 
ment  allows  only  the  fixed  quantity  spaces  of  sign  and  order  of  magnitude,  not  arbitrary 
discretizations  of  quantity  spaces  as  in  [Kuipers,  85].  Our  order  of  magnitude  algebra  uses 
only  three  fixed  ranges  of  orders  of  magnitudes,  SMALL,  MEDIUM,  and  LARGE,  not  arbi- 
trarily many  as  in  [Raiman,  86]  and  [Dague,  Raiman,  and  Deves,  87].  ([Weld,  87]  presents 
an  alternative  method  for  making  this  inference,  which  does  not  assume  that  the  ratio 
between  the  blocks  is  infinite,  but  rather  does  a  careful  analysis  of  the  relation  between 
changes  in  in  parameters  and  their  derivatives.  This  theory  is  stronger  than  ours,  in  that  it 
requires  weaker  assumptions  in  the  input,  but  it  seems  to  be  less  generally  applicable.) 

The  paper  is  organized  as  follows:  Section  2  presents  fundamental  definitions  and  rules 
that  govern  the  behavior  of  a  solution  to  a  set  of  qualitative  differential  equations.  The  basis 
of  the  definitions  and  the  justifications  of  the  rules  lie  in  the  theory  of  non-standard  analysis 
with  infinitesimals.  Section  3  shows  how  these  rules  may  be  applied  in  the  "heavy  block  on 
the  spring"  example  and  other  examples.  Section  4  presents  an  algorithm  for  constructing  an 
envisionment  graph  from  a  set  of  qualitative  differential  equations.  Section  5  discusses  the 
CHEPACHET  program,  which  implements  this  algorithm.  Section  6  makes  suggestions  for 
further  work.  We  assume  that  the  reader  is  familiar  with  standard  theories  of  envisionment, 
as  in  [Kuipers,  85]  and  [de  Kleer  and  Brown,  85],  and  with  non-standard  analysis,  as  in 
[Robinson,  66]  and  [Davis  and  Hersch,  72]. 

2.    Theory 

We  follow  [Kuipers,  85]  in  structuring  our  theory. 

Quantities,  derivatives,  and  time  may  take  values  from  within  the  non-standard  real  line 
R*.  We  divide  this  line  into  seven  disjoint  intervals:  (We  use  notation  from  [Dague,  Raiman, 
and  Deves].) 

ZERO  =  {0} 

SMALL  =  {  X|X>0  and  A'«l  }  (infinitesimals) 

MEDIUM  =  {  X|X>0  and  X~l  }  (standard  reals) 

LARGE  =  {  X  |X>0  and  1«X  }  (infinitely  large  reals) 

-  SMALL  =  {  X  I  -X  €  SMALL  } 

-MEDIUM  =  {X|-X€MEDIUM  } 

-LARGE  =  {X|-X€LARGE  } 

These  sets  and  their  unions  are  the  qualitative  sets.  Note  that  SMALL  and  LARGE  span 
many  orders  of  magnitude  while  MEDIUM  spans  only  one  order  of  magnitude.  Note  also 
that  there  is  no  landmark  value  separating  SMALL  from  MEDIUM  or  MEDIUM  from 
LARGE;  in  non-standard  analysis,  there  is  no  largest  infinitesimal  or  smallest  positive  stan- 
dard real.  Our  theory  can  easily  be  extended  to  incorporate  any  finite  number  of  orders  of 
magnitude;  we  might,  for  example,  have  five  positive  orders  VERY_SMALL,  SMALL, 
MEDIUM,  LARGE,  VERY_LARGE.  It  is  not  clear  how  to  extend  the  theory  to  infinitely 
many  different  orders,  as  in  [Dague,  Raiman,  and  Deves,  87]. 

Table  1  shows  the  basic  arithmetic  operations  on  the  qualitative  values.  These  follow 
directly  from  the  axioms  presented  in  [Raiman,  86].  Following  the  notation  of  [de  Kleer  and 
Brown,  85]  and  [Kuipers,  85]  we  define  [X]  to  be  the  qualitative  set  containing  the  value  X; 
for  example  [-1]  =  -MEDIUM,  [0]  =  ZERO.  If  P  (r)  is  a  function  of  T,  then  we  will  use 
the  notation  dP  for  the  qualitative  value  of  the  derivative;  that  is,  dP(T)=[P(T)]. 
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Bracketed  pairs,  above,  desiqnate  intervals.  For  example  [-M,M] 
is  the  set  {-M.EDILIM,  -SMALL,  '  ZERO,  S:4ALL,  MEDIUM}.  [-L,-S]  is 
the  set  [-LARGE,  -MEDIUM,  -SMALL  }. 


Table  1:  Addition  and  Multiplication  of  Qualitative  Sets, 


An  interval  of  R*  is  a  set  of  points  /  such  that,  for  all  X.yC/,  if  X<Z<Y  then  Z€/.  An 
interval  /  is  closed  on  the  left  (right)  if  /  has  a  greatest  lower  bound  (least  upper  bound)  X, 
and  X  is  in  /.  It  is  open  on  the  left  (right)  if  it  has  a  greatest  lower  bound  (least  upper  bound) 
X,  and  X  is  not  in  /  or  if  it  is  unbounded  on  the  left  (right).  Thus,  the  interval  ZERO  is 
closed  on  both  left  and  right;  SMALL  is  open  on  the  left  and  neither  open  nor  closed  on  the 
right;  MEDIUM  is  neither  open  nor  closed  on  either  side;  and  LARGE  is  neither  open  nor 
closed  on  the  left  and  open  on  the  right.  Two  intervals  /  and  J  are  adjacent  if  they  are  disjoint 
and  /(J/  is  an  interval. 

We  will  be  interested  in  tracing  the  qualitative  values  of  a  number  of  parameters  over 

time  p  i(7"),  p2(^).  '  '  •  Pt(r)  and  their  derivatives  p  i(r),  P2(^).  '  '  '  Pk(T).  The  parameters 
and  time  take  their  values  from  the  non-standard  real  line.  The  parameters  are  assumed  to  be 
C';  that  is,  they  are  continuous,  and  their  derivatives  are  continuous.  A  collection  of  k  such 
functions  is  called  a  behavior. 

We  define  the  inertial  state  of  the  parameters  p  ]{T)  ■  ■  ■  Pk(T)  to  be  the  2ik-tuple  of  the 
qualitative  states  of  all  the  parameters  and  their  derivatives: 

S=  <  [pdT)]  ■  ■  ■  b  *(?■)].  [Pi  (7-)]  •  •  •  [Pk(T)]  > 

An  inertial  state  S  characterizes  a  behavior  P  =  <p  i(r)  •  •  •  Pk{T)>  over  a  time  interval  /  if  5 
is  the  inertial  state  of  P  for  all  T  in  /. 

An  inertial  subdivision  of  an  interval  /  by  a  behavior  P  is  a  set  J  of  subintervals  of  /, 
where  /a€2iff  the  following  conditions  hold: 

i.        /„  C  / 

ii.       There  is  a  single  inertial  state  So.  that  characterizes  P  throughout  la- 

iii.      /„  is  maximal  with  respect  to  properties  (i)  and  (ii). 

It  should  be  clear  that,  for  any  interval  /  and  behavior  P ,  there  is  a  unique  subdivision; 
that  the  intervals  of  the  subdivision  are  disjoint;  and  that  their  union  is  /. 

We  are  also  interested  in  the  order  of  magnitude  of  intervals  and  of  the  change  to  a 
parameter  over  an  interval.  Let  P  =  <p\{T)  ■  ■  ■  Pk{T)>  be  a  behavior  and  let  /  be  a  time 
interval.  We  define  the  time  duration  of  /,  written  AT(I),  to  be  the  maximum  over  all  points 
Tx,T2^I  of  [7"2-ri].  We  define  the  variance  of  parameter  p(r)  over  /,  written  ^p(I),  to  be 
the  maximum  over  all  7",,7'2€/  of  b(^i)~P(^2)] 

A  specification  of  the  qualitative  values  [p,],  dp,,  Ap,,  and  AT  is  called  a  qualitative 
state.  We  can  now  state  a  number  of  basic  rules  that  relate  the  qualitative  states  of  adjacent 
intervals  in  an  inertial  decomposition.  Let  P  =  <p  i{T)  ■  ■  ■  pk(T)>  be  a  behavior;  let  p  be  a 
parameter  in  P;  let  /  be  an  interval;  and  let  A  and  B  be  two  adjacent  intervals  in  the  inertial 
subdivision  of  /  by  P.  Then  the  following  rules  hold: 

Change:  There  is  a  parameter  p  such  that  either  [p(A)]i^[p  (B)]  or  dp{A)i' dp{B).  That 
is,  something  changes  from  A  to  B. 

Continuity:  From  A  to  B,  [p]  and  dp  must  either  stay  constant,  or  go  from  one  qualita- 
tive set  to  a  bordering  set,  in  the  ordering  -LARGE,  -MEDIUM,  -SMALL,  ZERO, 
SMALL,  MEDIUM,  LARGE.  (Note  that  variances  and  time  durations  do  not  have  to  change 
continuously.) 

Mean  value  from  ZERO:  If  [p]=ZERO  in  A  and  dp>0  in  A,  then  [p]  =  SMALL  in  B. 
If  [p]  =  ZERO  in  A  and  ap<0  in  A,  then  [p  ]= -SMALL  in  fl. 

Mean  value  to  ZERO:  If  [p]  =  ZERO  in  B  and  dp<0  in  B,  then  [p]=SMALL  in  A.  If 
[p]=ZERO  InB  and  dp>0  in  S,  then  [p]= -SMALL  in  A. 

Mean  value  from  non-zero:  If  [p  ]  is  greater  in  B  than  in  A,  and  is  not  equal  to  ZERO 
in  either  state,  then  dp  must  be  positive  in  both  A  and  B.  If  [p]  is  less  in  B  than  in  A,  and  is 
not  equal  to  ZERO  in  either  state,  then  dp  must  be  negative  in  both  A  and  B. 


Variance  over  time:  The  variance  of  p  in  A  is  the  absolute  value  of  the  derivative  of  p 
times  the  duration. 

Ap  C  abs(ap)  X  AT" 

Variance  bound:  The  variance  of  a  parameter  is  never  greater  than  its  absolute  value. 

Ap  <  abs([p]) 

Variance  at  state  change:  If  \p]  changes  from  A  to  S  then  A/7(i4)  =  abs([p(fl)]),  and 
Ap(A)=abs([p(S)])  Note  the  contrapositive  of  this  statement:  If  A/j  in  A  is  small  as  com- 
pared to  [p  ]  in  i4,    then  \p\  does  not  change  from  A  to  B. 

No  successive  instants:  A  and  B  cannot  both  have  duration  ZERO. 

Temporal  topology:  By  the  rule  of  continuity,  any  transition  of  a  parameter  or  deriva- 
tive p  from  A  to  fi  has  one  of  the  following  forms: 

i.  p  (A)  =  ZERO  and  p(B)=±  SMALL; 

ii.  p(A)=  ±SMALL  and  p(B)=ZERO; 

iii.  p  (A)  =±  MEDIUM  and  p(S)=  ±  SMALL  or  ±  LARGE; 

iv.  p(A)=± SMALL  or  ± LARGE  and  p  (S)=  ± MEDIUM; 

The  rule  of  temporal  topology  states  that,  going  from  one  interval  A  to  the  next  interval  B, 
only  one  of  these  types  of  transitions  can  occur.  Thus,  for  example,  it  is  possible  to  have  p 
change  from  SMALL  in  A  to  MEDIUM  in  B  and  have  q  change  from  -  LARGE  in  A  to 
-MEDIUM  in  B,  since  these  are  both  transitions  of  type  (iv).  It  would  not  be  possible  to 
have  p  change  from  SMALL  in  A  to  MEDIUM  in  B,  and  q  change  from  ZERO  in  A  to 
SMALL  in  S,  since  the  first  is  a  transition  of  type  (iv),  while  the  second  is  a  transition  of  (i). 

The  fundamental  reason  for  this  rule  is  that  different  types  of  transitions  imply  different 
topologies  for  the  intervals.  If  a  transition  of  type  (i)  occurs  from  A  to  B,  then  A  must  be 
closed  on  the  right,  and  B  must  be  open  on  the  left.  If  a  transition  of  type  (ii)  occurs,  then  A 
must  be  open  on  the  right  and  B  must  be  closed  on  the  left.  If  a  transition  of  type  (iii)  or  (iv) 
occurs,  then  A  must  be  neither  open  nor  closed  on  the  left,  and  B  must  be  neither  open  nor 
closed  on  the  right.  We  can  show  that  transitions  of  type  (iii)  cannot  occur  together  with  tran- 
sitions of  type  (iv)  by  the  following  argument:  Suppose  that  p  (7")  goes  from  MEDIUM  in  A 
to  LARGE  in  B ,  and  that  q  {T)  goes  from  SMALL  in  A  to  MEDIUM  in  B.  Then  the  continu- 
ous function  p  (T")-^  (7)  would  go  from  SMALL  in  A  to  LARGE  in  B,  which  is  impossible. 

Forever  is  a  long  time:  If  A  is  unbounded  on  the  left  or  on  the  right,  so  that  it  has  no 
adjacent  following  state  or  no  adjacent  preceding  state,  then  AT" (A)  =  LARGE. 

The  rules  of  change,  continuity,  mean  value,  and  no  successive  instants  are  the  same  or 
direct  analogues  of  standard  rules  from  the  ordinary  theory  of  QDE's  ([Kuipers,  85],  [Willi- 
ams, 85]).  William's  "Epsilon  Transition"  rule,  that  a  transition  to  zero  must  take  finite 
time,  is  a  consequence  of  the  rules  above:  If  p  is  non-zero  in  A  and  zero  in  B,  then  by  the 
rule  of  "Variance  at  state  change,"  Ap  is  non-zero  in  A,  so  by  the  rule  of  variance  over  time, 
AT  cannot  be  zero  in  A. 

These  rules  together  are  a  sufficient  characterization  of  order  of  magnitude  of  values, 
derivatives,  variances  and  durations  in  adjecent  intervals.  That  is,  given  two  qualitative  states 
S/^  and  Sb  satisfying  the  above  rules,  other  than  the  "Forever"  rule,  one  can  find  a  behavior  P 
and  two  adjacent  intervals  A  and  B,  such  that  Sa  is  the  qualitative  state  of  P  in  A,  and  Sb  is 
the  qualitative  state  of  P  in  B.  Given  a  state  A  satisfying  the  rules  of  "Variance  over  time", 
"Variance  bound"  and  "Forever",  there  is  a  behavior  which  stays  with  qualitative  state  A  over 
all  time.  We  omit  the  proof  of  this;  it  involves  a  long  but  straightforward  case  analysis, 
enumerating  all  the  possible  values  of  any  parameter  over  the  two  states,  constructing  partic- 
ular functions  that  exhibit  each  of  them,  and  showing  that  any  pair  of  transitions  allowed 
together  by  the  "Transition  topology"  rule  will  occur  simultaneously  for  our  sample  functions. 
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A  stronger  hypothesis  seems  plausible,  but  we  have  no  proof: 

Hypothesis  1:  Given  any  finite  sequence  of  qualitative  states  such  that  any  two  succes- 
sive qualitative  states  satisfy  the  above  rules,  there  exists  a  behavior  and  an  interval  such  that 
the  qualitative  states  characterize  the  behavior  over  the  interval. 

So  far,  we  have  been  speaking  about  the  space  of  all  possible  behaviors.  For  any  partic- 
ular problem,  we  will  be  working  with  qualitative  constraints  on  the  problem.  A  homogene- 
ous system  of  first-order  qualitative  constraints  (abbreviated  QC's)  is  a  set  of  relations  among 
the  orders  of  magnitudes  of  the  parameter  values  and  their  derivatives.  ("Homogeneous" 
here  refers  to  the  absence  of  an  explicit  time  dependence.)  That  is,  such  a  system  specifies 
certain  combinations  of  the  values  [p,(r)]  and  dp,{T)  to  be  physically  possible,  and  others  to 
be  impossible.  For  example,  we  might  specify  the  relation  dX=  -  [X];  this  states  that  the  only 
qualitative  states  allowed  are  [X]=LARGE,  aX= -LARGE;  [X]  =  MEDIUM, 
dX=  -  MEDIUM,  etc.  A  behavior  P  satisfies  a  set  of  QC's  only  if  the  orders  of  magnitude  of 
its  parameters  and  their  derivatives  satisfy  the  set  at  each  instant  of  time. 

Thus,  if  a  behavior  satisfies  a  set  of  QC's,  then  all  its  inertial  states  satisfies  the  QC's, 
and  any  two  adjacent  qualitative  states  satisfy  the  rules  enumerated  above.  If  Hypothesis  1 
above  is  true  then  the  converse  of  this  statement  is  likewise  true: 

Hypothesis  2:  Given  a  set  of  QC's  and  a  finite  sequence  of  qualitative  states,  if  each 
successive  pair  of  states  satisfies  the  rules  above  and  the  inertial  state  of  each  qualitative  state 
satisfies  the  QC's,  then  there  exist  a  behavior  P  and  a  time  interval  /  such  that  the  qualitative 
states  characterize  P  over  /,  and  such  that  P  satisfies  the  QC's  throughout  /. 

Proof  given  hypothesis  1:  Use  hypothesis  1  to  construct  a  behavior  P  and  an  interval  / 
following  the  qualitative  states.  Then  P  satisfies  the  QC's  since  the  states  do. 

A  particularly  important  class  of  qualitative  constraints  are  the  qualitative  differential 
equations  (abbreviated  QDE's).  Let  Fj,  •  •  •  ,Ft  be  t  continuous  t-place  functions  over  the 
non-standard  real  line.  Then  the  system  of  (exact)  differential  equations 

PdT)  =  F,{p,{T),pj{T)  ■  ■  -p.iT)) 
P2iT)  =  F2(p,iT),p2(T)---p,(T)) 

PkiT)  =  F,(p,(r),p2(7-)  •  •  -ptiT)) 

is  guaranteed  to  have  C'  solutions  for  any  given  initial  values  Pi(0)  •  •  ■  Pk(.0).  We  now 
discretize  the  functions  F,  to  multi-valued  functions  G,  over  qualitative  sets.  That  is,  if 
Ci  ■  ■  ■  G*  are  qualitative  values,  we  define  G,(Ci  ■  ■  •  C/.)  by  considering  all  possible  choices 
of  JC]  •  •  •  xjt  from  Qi  ■  ■  ■  Qk,  and  taking  the  qualitative  value  of  F,(xi  •  •  ■  x^.). 

G,(ei  •  •  •  Qk)  =  {[F,{xx  ■  ■  ■  x^)]  I  XiCCi  •  •  •  x^eej 

Thus,  for  example,  if  F(X,Y)  =  X^  +  Y  then  G(SMALL,MEDIUM)  =  SMALL^  +  MEDIUM 
=  {MEDIUM}.  G(MEDIUM,- MEDIUM)  =  MEDIUM^  -  MEDIUM  =  {-MEDIUM, 
-SMALL,  ZERO,  SMALL,  MEDIUM}. 

The  condition  that  the  F,  be  continuous  implies  conditions  on  the  G,  analogous  to  the 
rule  of  continuity  and  the  rule  of  temporal  topology  given  above. 

A  set  of  QDE's  is  then  the  discretized  form  of  the  original  exact  differential  equations. 

dPi(T)  €  G,([p,(r)]-  •     [p.iT)]) 

ap2(r)  €  G2([p,(n]- • -[p^cni) 

dp,(T)  e  G,(b,(7-)]-  •  -[PkiT)]) 

Note  that  there  can  be  behaviors  that  satisfy  some  system  of  QDE's  but  do  not  satisfy 
any  system  of  homogeneous  exact  differential  equations.  For  instance,  any  solution  of  a  set 
of  homogeneous  exact  differential  equations  that  goes  through  the  values  <0,  0,  0  ...  0> 
more  than  once  must  be  cyclic,  by  the  uniqueness  of  solutions  to  the  initial  value  problem; 
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this  docs  not  hold  for  solutions  to  homogeneous  QDE's. 

3.    Sample  Inferences 

We  now  show  how  these  rules  can  be  used  to  prove  that  a  heavy  block  on  a  spring  will 
take  a  long  time  to  reach  the  rest  point. 

Let  X  be  the  displacement  of  the  block  from  its  rest  point,  let  v  be  its  velocity,  and  let  / 
be  the  force  exerted  by  the  spring  on  the  block.  The  physical  constraints  on  the  system  can  be 
characterized  by  the  following  QDE's 

dx  =  [v] 
[/•]  =  LARGE  •  dv 

The  first  equation  just  establishes  v  as  the  derivative  of  x.  The  second  equation  corresponds 
to  Newton's  second  law  f=ma.  In  this  case,  m  is  assumed  to  be  LARGE,  and  a  =  dv.  The 
third  equation  is  a  very  weak  form  of  Hooke's  law;  it  states  that  the  spring  exerts  a  force 
opposite  to  the  displacement  and  of  the  same  order  of  magnitude.'  For  convenience  of  expo- 
sition, we  will  eliminate  /and  rewrite  the  equations  in  the  equivalent  form: 

dx  =  [v] 
dv  =  [x]/LARGE 

Nothing  in  our  analysis  depends  on  this  rewriting;  it  would  go  through  in  the  same  way  in 
either  presentation. 

We  now  track  the  sequence  of  qualitative  states  consistent  with  the  QDE's  and  with  the 
rules  enumerated  above,  and  we  show  that  any  path  which  brings  the  block  to  its  rest  point 
includes  a  state  whose  duration  is  LARGE. 

We  start  with  the  block  at  some  finite  displacement  and  zero  velocity.  That  is,  in  state 
Al,  [x]  =  MEDIUM,  [v]  =  ZERO.  From  the  differential  equations,  dx=v  must  be  ZERO, 
while  3v=  -x/LARGE  must  be  —SMALL.  By  variance  bound,  Av  must  be  ZERO.  By  vari- 
ance over  time,  therefore,  AT"  =   -| — r  =  ZERO.  Applying  variance  over  time  in  the  other 

\dv\ 

direction,  Ax  =  LTdx  =  ZERO.  So  we  have  a  complete  description  of  Al: 

Al:        [x]  =  MEDIUM        dx  =  ZERO  Ax  =  ZERO 

[v]  =  ZERO  av  =  -SMALL         Av  =  ZERO 

AT-  =  ZERO 

Let  A2  be  the  next  state.  Since  [x]  =  MEDIUM  and  Ax  =  ZERO  in  Al,  by  variance 
at  state  change  [x]  must  be  MEDIUM  in  A2.  By  mean  value  from  zero  and  continuity,  [v] 
must  be  —SMALL.  From  the  differential  equations,  dx  and  dv  must  both  be  —SMALL.  We 
cannot  determine  the  variances  or  duration  until  we  look  at  the  next  state. 

Let  A3  be  the  next  state.  In  A3,  either  [x],  [v],  dx  or  dv  must  change.  In  fact,  since  the 
differential  equations  determine  dx  and  dv  from  [x]  and  [v],  either  [x]  or  [v]  must  change. 
By  mean  value  at  non-zero  and  continuity,  we  must  either  have  [v]  change  to  —MEDIUM,  or 
have  [x]  change  to  SMALL  or  both.  We  will  consider  each  of  these  in  turn: 

Suppose  that  [v]  changes  to  -MEDIUM  in  A3.  Then,  by  the  rule  of  variance  at  state 
change,  we  must  have  Av  =  MEDIUM  in  A3.  As  we  have  stated,  [x]  must  either  MEDIUM 
or  SMALL  in  A3.  Hence,  from  the  differential  equation  3v=[x]/LARGE  must  be  -SMALL 

Av 
in  A3.  By  variance  over  time,  in  A3,  Ar=T — r=  LARGE.  But,  from  the  differential  equa- 

^  \dv\  ^ 

1.  It  may  be  asked,  what  does  it  mean  to  compare  the  order  of  magnitude  of  a  force  with  that  of  a  dis- 
tance?. The  answer  is  that  we  are  assuming  here  standard  units  of  mass,  distance,  and  time.  Thus,  the 
question  being  addressed  is  actually  a  comparative  one:  assuming  the  mass  is  large  compared  to  a  stan- 
dard mass,  what  does  that  imply  about  the  time,  compared  to  a  standard  time? 


tions,  dx=[v]=  MEDIUM,  and,  by  variance  over  time,  Ax  =  AT-\dx\  =  LARGE.  But  this 
contradicts  the  rule  of  bounded  variance,  since  Ax  is  now  larger  than  [x].  Thus  the  assump- 
tion that  [v]  changes  to  —MEDIUM  must  have  been  false. 

The  only  option  left  for  A3,  therefore,  is  that  [x]  changes  to  SMALL,  and  that  [v] 
remains  -SMALL.  From  the  differential  equations,  we  have  dx  =  -SMALL,  and  dv  = 
-SMALL.  Applying  the  variance  at  state  change  to  A2  and  A3,  we  must  have  that  in  A2,  Ax 
=   MEDIUM,  and  in  A3,  Ax  =   SMALL.  Therefore,  by  variance  over  time  in  A2,  we  have 

AT=  -. r  =  LARGE.    Combining  bounded  variance  and  variance  over  time,  we  find  that, 

\dx  I 
in  A2,  Av=  SMALL.  We  have  now  completely  characterized  A2: 

A2:        [x]  =  MEDIUM        dx  =   -SMALL         Ax  =  MEDIUM 
[v]  =   -SMALL        dv  =   -SMALL         Av  =  SMALL 
AT  =  LARGE 

Thus,  we  have  shown  that,  before  the  block  can  reach  the  zero  point,  it  must  pass 
through  a  state  of  LARGE  duration,  which  was  the  desired  result. 

We  cannot  show  in  this  system  that  the  block  ever  will  reach  the  rest  point.  Since  AT"  = 
LARGE  in  A2,  it  is  consistent  with  our  rules  that  the  system  should  stay  in  A2  forever.  In 
fact,  the  behavior 

x(7-)=l  +  ^-*''' 
v(7-)  =  6(<r-*'"-l) 

where  6  is  a  SMALL  real  number,  starts  in  Al  at  T=0,  and  then  stays  in  A2  forever.  (See 
figure  1.) 

m 


xCr) 


Figure  1:  A  behavior  that  stays  in  A2  forever 

By  contrast,  solutions  to  the  corresponding  second  order  QDE  [x{T)]=  -  [x(r)]/LARGE 
cannot  stay  in  A2  forever;  x(T)  must  cross  the  value  ZERO.  We  can  make  the  following 
argument  by  contradiction.  Suppose  x(T)  is  always  positive.  Then  x(T)  will  always  be  nega- 
tive, so  x(r)  will  be  uniformly  decreasing.  Thus,  if  x(r)  attains  a  value  -e  at  some  time  71, 
it  will  have  value  less  than  or  equal  to  -  e  for  all  T>T\.    However,  for  T>T\, 

T 

x{T)-x{T\)  =   SxiT)dT  <  -f.{T-Tl) 

T\ 

Thus  x(7")  will  be  negative  for  all  7>7"l-i-x(ri)/€,  contradicting  the  assumption.  Thus, 
information  is  irrecoverably  lost  in  converting  the  higher-order  QDE  into  a  first-order  QDE. 
(Nothing  in  this  argument  depends  on  the  order  of  magnitude  mechanism.  The  same  loss  of 
information  happens  in  converting  the  ordinary  second-order  QDE  [x(7")]=  -  [x(7")]  to  the 
two  first  order  QDE's  ax(7)=  [v(7-)],  dv(T)=  -[x(r)].) 

Our  rules  also  do  not  allow  us  to  say  much  after  what  happens  to  the  system  past  this 
state.  In  fact,  from  this  starting  state,  the  system  can  reach  any  consistent  intertial  state  what- 
ever, except  for  the  rest  state  [x]  =  ZERO,  [v]  =  ZERO. 
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In  a  similar  way,  the  rules  above  allow  us  to  show  results  such  as  the  following: 

Fast  exponential  growth  and  decay:  The  initial  value  problem  (IVP)  8x=  LARGE[x], 
[;r(0)]  =  MEDIUM,  spends  a  SMALL  time  with  [x]  =  MEDIUM,  and  then  goes  to  [x]  = 
LARGE  forever.  The  problem  ax=  -  LARGE[x],  [x(0)]  =  MEDIUM,  spend  a  SMALL  time 
with  [x\  =  MEDIUM  and  then  goes  to  [x\  =  SMALL.  It  then  either  stays  forever  with  [x]  = 
SMALL,  or  it  transits  after  a  period  of  undetermined  length  to  [x]  =  ZERO,  where  it  stays 
forever. 

Slow  exponential  growth  and  decay:  The  IVP  dx  =  SMALL[x],  [x(0)]=  MEDIUM, 
spends  a  LARGE  time  with  [x]  =  MEDIUM,  and  then  goes  to  [x]  =  LARGE  forever.  The 
problem  ax= -SMALL[x],  [x(0)]  =  MEDIUM,  spend  a  LARGE  time  with  [x]  =  MEDIUM 
and  then  goes  to  [x]  =  SMALL.  It  then  either  stays  forever  with  [x]  =  SMALL,  or  it  transits 
after  a  period  of  undetermined  length  to  [x]  =  ZERO,  where  it  stays  forever. 

Highly  damped  oscillation:  The  IVP 

ax=[v]  [*(0)]=  MEDIUM 

8v=-LARGE-[v]-[x]        [v(0)]  =  SMALL 

corresponding  to  a  very  heavily  damped  harmonic  oscillator,  transits  immediately  to  [x]  = 
MEDIUM,  [v]  =  SMALL.  It  stays  thus  for  a  LARGE  time,  (8v  can  in  the  meantime  take  on 
a  wide  range  of  values)  and  eventually  transits  to  the  state  [x]  =  SMALL  [v]= -SMALL, 
From  here  on  in,  it  can  oscillate  indefinitely  cyclically  with  the  absolute  values  of  both  x  and 
V  remaining  SMALL  or  ZERO,  or  it  can  settle  forever  in  any  such  state,  or  in  the  state  [x]  = 
[v]  =  ZERO. 

Quickly  settling  control  parameter:  The  IVP 

du{T)  =   -LARGE[u(7-)]  [m(0)]  =  MEDIUM 

dx{T)  =  -(MEDIUM  +  [u[T)])x(T)        [x(0)]  =  MEDIUM 

represents  a  quickly  settling  function  u{T)  controlling  a  more  slowly  moving  function  x{T). 
The  solution  has  ["(r)]  become  SMALL  or  ZERO  in  SMALL  time,  while  x(r)  remains 
MEDIUM.  Thereafter,  x(7")  undergoes  exponential  decay  with  a  MEDIUM  time  constant, 
eventually  getting  to  SMALL  values  after  LARGE  time.  (It  takes  only  MEDIUM  time  to  get 
to  any  value  in  MEDIUM  but  LARGE  time  to  get  to  any  value  in  SMALL.)  Such  problems 
are  similar  to  those  studied  by  Kuipers  (1987). 

4.    Envisionment  graphs 

We  next  present  an  algorithm  for  constructing  an  envisionment  graph  for  a  given  set  of 
QDE's.  An  envisionment  graph  for  a  set  of  QDE's  is  a  data  structure  that  indicates  the  tran- 
sitions possible  for  solutions  of  the  QDE.  If  hypothesis  2  is  correct,  it  thus  indicates  all  the 
finite  characterizations  of  parts  of  solutions.  The  value  of  envisionment  graphs  for  this  class 
of  problems  is  somewhat  limited,  for  two  reasons.  First,  they  are  very  large.  Since  all  initial 
value  problems  have  solutions,  any  assignment  of  qualitative  values  to  the  parameters  must 
be  a  possible  state  of  the  system.  Thus  there  are  at  least  7*  inertial  states  for  a  system  of  Jfc 
parameters;  in  general,  there  will  be  more,  since  some  of  the  QDE  functions  G,  will  be 
multi-valued.  In  practice,  this  limits  the  generation  of  envisionments  to  QDE's  with  at  most 
three  variables. 

Second,  the  characterization  of  a  behavior  over  all  time  may  be  considerably  richer  than 
its  behavior  over  any  finite  sequence  of  qualitative  states.  For  example,  consider  the 
(second-order)  IVP 

[i{T)\  =  -[x(7)]  [x(0)]  =  MEDIUM        fx(0)]  =  ZERO 

[U{T)]  =  MEDIUM        [«(0)]  =  MEDIUM 

We  would  like  to  say  that  x  oscillates  forever  in  MEDIUM  amounts  of  time,  while  u  attains 
LARGE  values  over  LARGE  amounts  of  time.    However,  nothing  about  finite  sequences 
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will  ever  say  this.  In  fact,  there  is  no  allowable  transition  in  which  u  goes  from  MEDIUM  to 
LARGE. 

In  this  case,  of  course,  it  is  possible  to  separate  the  two  equations,  and  give  separate 
histories,  along  the  lines  suggested  in  [Williams,  86].  However,  it  is  not  clear  how  this  can 
be  done  in  general;  for  example,  how  it  could  be  done  for  the  equations 


[xiT)]=-[u{T)][x{T)] 
[u(r)]=  MEDIUM  +  [x] 


[x(0]=[i(0)]=  SMALL 
[u(0)]  =  MEDIUM 


Despite  these  difficulties,  we  have  studied  the  construction  of  cnvisionment  graphs  as  a 
natural  and  simple  first  step  toward  deeper  analysis.  We  will  discuss  possible  directions  for 
improvement  in  section  6.  (Forbus  (1987)  gives  a  defence  of  building  cnvisionments  for  con- 
ventional QDE's.  There,  the  first  problem  is  much  alleviated,  since  the  number  of  qualitative 
values  is  smaller,  and  the  second  problem  can  be  ruled  out.) 

We  will  construct  an  cnvisionment  graph  whose  nodes  are  intertial  states,  labelled  by 
sets  of  possible  variances  and  durations,  and  whose  directed  arcs  are  transitions,  labelled  by 
sets  of  the  possible  variances  and  durations  that  the  end  nodes  can  take,  if  the  transition 
occurs.  For  example,  figure  2  shows  a  piece  of  a  graph,  with  nodes  A  and  B,  and  a  transition 
from  A  to  B.  The  labels  on  A  mean  that  [X]  is  MEDIUM  in  A,  dX  is  MEDIUM,  AX  is  either 
SMALL  or  MEDIUM,  and  AT"  is  either  SMALL  or  MEDIUM.  The  labels  on  B  mean  that 
[X]  is  LARGE  in  B,  that  dX  is  SMALL,  that  AX  is  LARGE,  and  that  AT  is  LARGE.  The 
labels  on  the  arc  indicate  that,  if  a  behavior  goes  from  A  to  B,  then  it  will  have  AX  and  AT" 
equal  to  MEDIUM  in  A,  and  AX  and  AT  equal  to  LARGE  in  B. 


A-->  B 


B 


Figure  2:  A  labelled  transition 
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Algorithm  1  constructs  an  envisionment  graph  from  a  set  of  QDE's. 


Input:  A  set  of  QDE's  on  Jt  variables 
Output:  An  envisionment  graph 
begin 

1.  For  each  Jt-tuple  of  parameter  values,  find  the  possible  derivative  values  from  the 
QDE's.  Construct  all  possible  inertial  states. 

2.  For  each  inertial  state  /  do 

a.  For  each  parameter  p,  initialize  Ap  to  be  all  values  less  than  abs([p])  (Rule  of 
bounded  variance.) 

b.  Initialize  AT  to  be  the  intersection  of  all  values  of  LplBp 

c.  If  LARGE  is  a  possible  value  of  AT  then  construct  transition  from  /  to  the  special 
value  FOREVER,  and  from  FOREVER  to  /,  labelled  with  Ar=  LARGE. 

3.  For  each  pair  of  inertial  states  A  and  B  such  that  a  transition  from  A  to  fi  is  consistent 
with  the  rules  of  continuity,  mean  value,  and  temporal  topology, 

a.  Construct  a  transition  arc  A-fl  from  A  to  B. 

b.  For  each  parameter  p,  if  [p]  is  different  in  A  than  in  B,  then  mark  the  in  value  and 
out  value  of  Ap  on  the  arcs  as  the  value  of  abs([p])  in  A  and  B  respectively. 

4.  We  must  now  shrink  the  labels  on  each  node  and  arc  to  N  so  as  to  satisfy  the  following 
conditions: 

a.  For  each  node  A  and  parameter  p,  the  values  of  AT"  must  be  a  subset  of  Ap/dp, 
and  the  values  of  Ap  must  be  a  subset  of  ATdp. 

b.  Any  value  of  AT  or  Ap  in  A  must  be  a  in-value  in  some  arc  A~B  and  an  out-value 
in  some  arc  Q~A.    (Note  that  B  and  Q  here  can  be  FOREVER.) 

c.  Any  in-value  (out-value)  of  AT"  or  Ap  in  the  arc  A-fl  must  be  a  possible  value  in  A 
(B). 

d.  Any  transition  which  has  no  possible  values  for  AT  or  Ap  is  an  impossible  transi- 
tion. 

We  can  achieve  these  conditions  by  a  standard  constraint  propagation  technique  [Waltz, 
75]:  Keep  shrinking  the  labels  to  accomodate  the  conditions,  and  eliminate  transitions 
that  have  no  consistent  labels.  Propagate  the  effect  of  each  change  until  no  more 
changes  occur. 


end; 


Algorithm  1:  Constructing  an  Envisionment  Graph 


Algorithm  1  is  clearly  a  sound  algorithm;  that  is,  if  x(t)  is  a  possible  solution  to  the 
QDE's  for  all  t,  then  any  transition  of  x(t)  is  represented  in  the  graph.  Soundness  holds  since 
the  algorithm  does  nothing  more  than  enforce  the  starting  QDE's  and  the  rules  governing 
transitions.  If  hypothesis  2  is  correct,  then  the  algorithm  is  also  complete;  that  is,  any  path 
through  the  graph  represents  the  behavior  of  some  solution  of  the  QDE's  over  an  interval. 
This  may  be  somewhat  surprising,  since  constraint  propagation  is  not,  in  general  complete. 
However,  the  significance  of  labels  in  this  graph  is  different  than  for  most  labelled  graphs.  If 
one  goes  around  a  cycle  in  the  envisionment  graph  and  comes  back  to  the  same  node,  it  need 
not  have  the  same  value  as  before,  since  it  corresponds  to  the  behavior  of  the  function  at  a 
different  time.  Therefore,  in  this  context,  local  consistency  of  the  graph  is  enough  to  establish 
its  global  consistency. 
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By  indexing  inertial  states  in  an  array  by  the  tuple  of  parameter  values,  step  3  of  the 
algorithm,  which  connects  bordering  states,  can  be  made  reasonably  efficient.  It  does  not 
involve  comparing  all  pairs  of  inertial  states,  but  only  states  that  differ  by  0  or  ±  1  in  each  of 
the  k  parameter  values  and  the  k  derivatives.  It  can  be  shown  that  the  procedure  above  runs 
in  time  at  most  quadratic  in  the  number  of  inertial  states  (which  is,  of  course,  itself  exponen- 
tial in  the  number  of  variables.) 

5.    The  CHEPACHET  Program 

The  above  algorithm  has  been  implemented  as  a  program  named  CHEPACHET. 
CHEPACHET  is  written  in  Franz  Lisp  and  runs  on  a  VAX/780.  It  is  roughly  600  lines  of 
code  long,  and  contains  85  function  definitions,  not  including  utility  macros  and  comments. 
It  has  not  been  compiled,  and  has  not  been  particularly  optimized  for  efficiency.  Input  to 
CHEPACHET  is  a  list  of  parameters  and  a  list  of  QDE's,  whose  left  side  is  the  derivative  of 
a  parameter,  and  whose  right  side  is  a  polynomial  in  the  parameters  and  the  qualitative 
values.  The  output  of  CHEPACHET  is  a  listing  of  the  graph. 


Equations 

Run  time 

No.  inertial  states 

No.  transitions 

dx=M 

1.2 

7 

6 

ax=[x] 

1.6 

7 

6 

dx=-[x] 

1.7 

7 

6 

dx  =  [x]+M 

2.0 

11 

14 

dx=L\x] 

2.1 

11 

14 

dx  =  S\x] 

2.35 

11 

14 

ax€{-L,-M,-S,Z,S,M,L} 

10.2 

49 

128 

dx  =  M 

11.4 

49 

54 

dx=\y] 

dy  =  \x] 

13.9 

49 

60 

dx=[-y] 
dy=[-x] 

14.4 

49 

60 

dx  =  y 
dy=  —X 

11.5 

49 

48 

dx  =  y 
ay=-LUl 

20.2 

77 

116 

dx  =  y 
dy=-S-[x] 

24.3 

77 

148 

dx  =  y 
ay--L\y]-\x] 

40.6 

113 

272 

dx=-L-[x] 

dy=-\xV\y] 

52.1 

109 

324 

dx  =  M 
dy  =  M 
dz=M 

108.0 

343 

414 

Table  2:  Results  of  CHEPACHET 


Table  2  shows  the  results  of  running  CHEPACHET  on  a  number  of  simple  QDE's  in 
terms  of  the  CPU  time  used  (in  CPU  seconds,  exclusive  of  garbage  collection  time),  and  the 
size  of  the  envisionment  graph  generated.  Garbage  collection  time  was  typically  one-quarter 
to  one-third  of  the  other  CPU  time. 
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6.  Further  Work 

Clearly,  the  above  analysis  and  the  CHEPACHET  programs  are  only  first  steps.  The 
theory  will  have  to  be  considerably  refined  before  it  is  of  any  practical  value.  Some  important 
issues  to  be  addressed  include  the  following: 

1.  How  can  this  theory  be  used  in  a  goal-directed  way,  so  that  useful  information,  such 
as  illustrated  in  section  3,  can  be  extracted  without  generating  the  whole  envisionment  graph? 
It  would  be  relatively  easy  to  fix  the  algorithm  to  generate  only  the  states  that  can  be  attained 
from  a  given  set  of  initial  conditions,  but  that  would  not  necessarily  improve  matters  much. 
In  the  problem  of  the  heavy  brick  on  a  spring,  every  state  but  one  (the  rest  state)  can  be 
reached  from  the  starting  conditions.  Many  of  the  states  generated  in  a  complete  envision- 
ment, such  as  the  SMALL  values  that  always  lie  between  MEDIUM  and  ZERO  values,  are 
not  particularly  significant  for  any  kind  of  inference.  Can  they  be  suppressed? 

2.  As  discussed  in  section  3,  information  is  lost  when  a  higher-order  qualitative  dif- 
ferential equation  is  converted  into  a  system  of  first-order  equations.  Is  there  any  easy  way  to 
extract  that  information  working  directly  with  the  higher  order  equations?  ([De  Kleer  and 
Bobrow,  84]  is  the  only  substantial  study  of  higher-order  QDE's) 

3.  How  can  behaviors  involving  infinitely  many  state  transitions  in  bounded  time  be 
characterized? 

4.  Can  the  theory  be  extended  in  a  reasonable  way  to  handle  more  levels  of  orders  of 
magnitude?  Currently,  once  a  parameter  and  its  derivative  both  become  SMALL  or  both 
become  LARGE,  it  becomes  impossible  to  say  anything  at  all  about  their  relative  sizes.  A 
more  flexible  representation  might  support  much  more  powerful  inferences. 
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